Title: A powerful non-parametric statistical framework for family-based association analyses
نویسندگان
چکیده
The Framingham Heart Study dataset used for the analyses was obtained from the National Center for Biotechnology Information database of genotypes and phenotypes (NCBI dbGaP) through accession number phs000128.v3.p3. ABSTRACT Family-based study design is commonly used in genetic research. It has many ideal features, including being robust to population stratification (PS). With the advance of high-throughput technologies and ever-decreasing genotyping cost, it has become common for family studies to examine a large number of variants for their associations with disease phenotypes. The yield from the analysis of these family-based genetic data can be enhanced by adopting computationally efficient and powerful statistical methods. We propose a general framework of family-based U-statistic, referred to as family-U, for family-based association studies. Unlike existing parametric-based methods, the proposed method makes no assumption of the underlying disease models, and can be applied to various phenotypes (e.g. binary and quantitative phenotypes) and pedigree structures (e.g., nuclear families and extended pedigrees). By only using within-family information, it can offer robust protection against PS. In the absence of PS, it can also utilize additional information (i.e., between-family information) for power improvement. Through simulations, we demonstrated that family-U attained higher power over a commonly used method, FBAT, under various disease scenarios. We further illustrated the new method with an application to a large-scale family data from Framingham Heart Study. By utilizing additional information (i.e., between-family information), family-U confirmed a previous association of CHRNA5 with nicotine dependence.
منابع مشابه
A powerful nonparametric statistical framework for family-based association analyses.
Family-based study design is commonly used in genetic research. It has many ideal features, including being robust to population stratification (PS). With the advance of high-throughput technologies and ever-decreasing genotyping cost, it has become common for family studies to examine a large number of variants for their associations with disease phenotypes. The yield from the analysis of thes...
متن کاملA Non-parametric Control Chart for Controlling Variability Based on Squared Rank Test
Control charts are used to identify the presence of assignable cause of variation in the process. Non-parametric control chart is an emerging area of recent development in the theory of SPC. Its main advantage is that it does not require any knowledge about the underlying distribution of the variable. In this paper a non-parametric control chart for controlling variability has been developed. I...
متن کاملRegression Modeling for Spherical Data via Non-parametric and Least Square Methods
Introduction Statistical analysis of the data on the Earth's surface was a favorite subject among many researchers. Such data can be related to animal's migration from a region to another position. Then, statistical modeling of their paths helps biological researchers to predict their movements and estimate the areas that are most likely to constitute the presence of the animals. From a geome...
متن کاملSOCR Analyses: Implementation and Demonstration of a New Graphical Statistics Educational Toolkit.
The web-based, Java-written SOCR (Statistical Online Computational Resource) tools have been utilized in many undergraduate and graduate level statistics courses for seven years now (Dinov 2006; Dinov et al. 2008b). It has been proven that these resources can successfully improve students' learning (Dinov et al. 2008b). Being first published online in 2005, SOCR Analyses is a somewhat new compo...
متن کاملImprovement of mapping accuracy by unifying linkage and association analysis.
It is well known that pedigree/family data record information on the coexistence in founder haplotypes of alleles at nearby loci and the cotransmission from parent to offspring that reveal different, but complementary, profiles of the genetic architecture. Either conventional linkage analysis that assumes linkage equilibrium or family-based association tests (FBATs) capture only partial informa...
متن کامل